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Abstract 

The spectral bound, s(a4 + /8V), of a combination of 
a resolvent positive linear operator A and an operator 
of multiplication V, was shown by Kato to be convex 
in j3 G R. This is shown here to imply, through an 
elementary lemma, that s(aA + fiV) is also convex in 
a > 0, and notably, d s(aA + /3V)/da < s(A) when it 
exists. Diffusions typically have s(A) < 0, so that for 
diffusions with spatially heterogeneous growth or decay 
rates, greater mixing reduces growth. Models of the 
evolution of dispersal in particular have found this result 
when A is a Laplacian or second-order elliptic operator, 
or a nonlocal diffusion operator, implying selection for 
reduced dispersal. These cases are shown here to be part 
of a single, broadly general, 'reduction' phenomenon^ 

Keywords: spectral bound - - reduction principle 
— evolution of dispersal — nonlocal dispersal — nonlocal 
diffusion 

The main result to be shown here is that the growth 
bound, uj(mA + V), of a positive semigroup generated by 
mA + V changes with positive scalar m at a rate less than 
or equal to ui(A), where A is also a generator, and V is an 
operator of multiplication. Movement of a reactant in a 
heterogeneous environment is often of this form, where V 
represents the local growth or decay rate, and m represents 
the rate of mixing. Lossless mixing means u(A) = 0, while 
lossy mixing means ui(A) < 0, so this result implies that 
greater mixing reduces the reactant's asymptotic growth 
rate, or increases its asymptotic decay rate. This is a 
familiar result when A is a diffusion operator, so what 
is new here is the generality shown for this phenomenon. 
At the root of this result is a theorem by Kingman on 
the 'superconvexity' of the spectral radius of nonnegative 
matrices [T]. The logical route progresses from Kingman 
through Cohen 2J to Kato j3] . The historical route begins 
in population genetics. 

In early theoretical work to understand the evolution 
of genetic systems, Feldman and colleagues kept finding 
a common result from each model they examined [H 



ED 12 El US EI] — be they models for the evolution of 
recombination, or of mutation, or of dispersal. Evolution 
favored reduced levels of these processes in populations 
near equilibrium under constant environments, and this 
result was called the Reduction Principle [10] . 

These results were found for finite-dimensional models. 
But the same reduction result has also been found in mod- 
els for the evolution of unconditional dispersal in continu- 
ous space, in which matrices are replaced by linear oper- 
ators. This raises the questions of whether this common 
result, discovered in such a diversity of models, reflects a 
single mathematical phenomenon. Here, the question is 
answered affirmatively. 

The mathematical underpinnings of the reduction prin- 
ciple for finite-dimensional models were discovered by Sam 
Karlin [12l [13] (although he did not realize it, and he had 
earlier proposed an alternate to the reduction principle — 
the mean fitness principle |14) . which was found to have 
counterexamples [T5] ) . Karlin wanted to understand the 
effect of population subdivision on the maintenance of ge- 
netic variation. Genetic variation is preserved if an allele 
has a positive growth rate when it is rare, protecting it 
from extinction. The dynamics of a rare allele are approx- 
imately linear, and of the form 



x(m) = [(l-m)I + mP]Dx(i) 



(1) 



where x(t) is a vector of the rare allele's frequency among 
different population subdivisions, m is the rate of dispersal 
between subdivisions, P is the stochastic matrix represent- 
ing the pattern of dispersal, and D is a diagonal matrix 
of the growth rates of the allele in each subdivision. The 
allele is protected from extinction if its asymptotic growth 
rate when rare is greater than 1. This asymptotic growth 
rate is the spectral radius, 



p(A) := max{|A| : A E er(A)}, 



(2) 



where er(A) is the set of eigenvalues of matrix A. 

Karlin discovered that for M(m) := [(1— m)I + mP], 
the spectral radius, p(M(m)D), is a decreasing function 
of the dispersal rate m, for arbitrary strongly-connected 
dispersal pattern: 



1 Dedicated to Sir John F. C. Kingman on the fiftieth anniversary 
his theorem on the 'superconvexity' of the spectral radius [T], which 
is at the root of the results presented here. 



Theorem 1 (Karlin Theorem 5.2, [HI pp. 194 196]) 

Let P be an arbitrary non-negative irreducible stochastic 
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matrix. Consider the family of matrices 

M(a) = (l-a)I + aP. 

Then for any diagonal matrix D with positive terms on 
the diagonal, the spectral radius 

p{a) = p(M(a)D) 

is decreasing as a increases (strictly provided D =/= dl). 

Theorem 5.2 means that greater mixing between subdi- 
visions produces lower p(M(m)D), and if it crosses below 
1, the allele will go extinct. While this theorem was mo- 
tivated by the issue of genetic diversity in a subdivided 
population, the generality of its form applies to any situ- 
ation where differential growth is combined with mixing. 
D could just as well represent the investment returns on 
different assets and P a pattern of portfolio rebalancing. 
Or D could represent the decay rates of reactant in differ- 
ent parts of a reactor, and P a pattern of stirring within 
the reactor. In a very general interpretation, Theorem 
5.2 means that greater mixing reduces growth and hastens 
decay. 

If the dispersal rate m is not an extrinsic parameter, 
but is a variable which is itself controlled by a gene, then a 
gene which decreases m will have a growth advantage over 
its competitor alleles. The action of such modifier genes 
produces a process that will reduce the rates of dispersal 
in a population. Therefore, Theorem 5.2 also means that 
differential growth selects for reduced mixing. 

In the evolutionary context, the generality of the mixing 
pattern P in Karlin's Theorem 5.2 makes it applicable to 
other kinds of 'mixing' besides dispersal. The pattern ma- 
trix P can just as well refer to the pattern of mutations be- 
tween genotypes, and then m refers to the mutation rate. 
Or P can represent the pattern of transmission when two 
loci recombine, and then m represents the recombination 
rate. The early models for the evolution of recombina- 
tion and mutation that exhibited the reduction principle 
in fact had the same form as ([1]) for the dynamics of a rare 
modifier allele. Once this was recognized [16l [XZl [18] , it 
was clear that Karlin's theorem explained the repeated ap- 
pearance of the reduction result in the different contexts, 
and generalized the result to a whole class of genetic trans- 
mission patterns beyond the special cases that had been 
analyzed. 

The dynamics of movement in space have been long 
modeled by infinite-dimensional models, where space is 
continuous and the concentrations of a quantity at each 
point are represented as a function. The dynamics of 
change in the concentration are modeled as diffusions, 
where the Laplacian or elliptic differential operator or non- 
local integral operator takes the place of the matrix P in 
the finite-dimensional case. When the substance grows or 



decays at rates that are a function of its location, the sys- 
tem is often referred to as a reaction-diffusion. In reaction- 
diffusion models for the evolution of dispersal, the reduc- 
tion principle again makes its appearance [19] [201 Lemma 
5.2] [2TJ Lemma 2.1][22]. In nonlocal diffusion models, 
again the reduction principle appears |23| . This points to 
the possibility of an underlying mathematical unity. 

Here, a broad characterization of this 'reduction phe- 
nomenon' is established by generalizing Karlin's theorem 
to linear operators. The reduction results previously found 
for various linear operators are, therefore, shown to be spe- 
cial cases of a general phenomenon. 

This result is actually implicit in Kato's generalization 
[3] of Cohen's theorem [2] on the convexity of the spectral 
bound of essentially nonnegative matrices with respect to 
the diagonal elements of the matrix. It is educed from 
Kato's theorem here by means of an elementary 'dual con- 
vexity' lemma. 

Kato's goal in [3] was to generalize, from matrices to 
linear operators, Cohen's convexity result [2]: 

Theorem 2 (Cohen ]2j) Let D be diagonal real n x n 
matrix. Let A be an essentially nonnegative n x n matrix. 
Then ,s(A+D) is a convex function o/D. 

Here, s(A+D) is the spectral bound — the largest real 
part of any eigenvalue of A+D. A synonym for the spec- 
tral bound used in the matrix literature is the spectral 
abscissa [24] [25] [26]. When the spectral bound is an 
eigenvalue, it is also referred to as the principal eigenvalue 
[27] , dominant eigenvalue [28] , dominant root [29] , Perron- 
Frobenius eigenvalue [30] . or Perron root [31 . 'Essen- 
tially nonnegative' means that the off-diagonal elements 
are nonnegative. Synonyms include 'quasi-positive' {32] , 
'Metzler', 'Metzler-Leontief ', 'ML' 31) . and 'cooperative' 
[33]: 

Cohen's proof relied upon the following theorem of 
Kingman: 

Theorem 3 (Kingman [T]) Let A be an n x n matrix 
whose elements, Aij{6), are non-negative functions of the 
real variable 9, such that they are 'superconvex', i.e. for 
each i,j, either log Aij(6) is convex in 9, or Aij(9) = 
for all 9. 

Then the spectral radius of A is also superconvex in 9. 

Kato generalized Cohen's result to linear operators by 
first generalizing Kingman's theorem. Before presenting 
Kato's theorem, some terminology needs to be introduced: 

X represents an ordered Banach space or its complexifi- 
cation. 

X + represents the proper, closed, positive cone of X, as- 
sumed to be generating and normal (see [3]). 
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B(X) represents the set of all bounded linear operators 
A: X \-+X. 

A is a positive operator if AX + C X + . 

The resolvent of A is R(£,A) := (£ — A)^ 1 , the operator 
inverse of £ — A, £ G C. 

The resolvent set q{A) C C are those values of £ for which 
£ — A is invertible. 

The spectrum of A G B(X), o~(A), is the complement of 
the resolvent set, g{A). 

The spectral bound of closed linear operator A, not neces- 
sarily bounded, is 



s(A) 



sup{Re(A) : A G a {A)} if a {A) ^ 
-oo ifo-(A) = 0. 



The type (growth bound) of an inhnitesimal generator, A, 
of a strongly continuous (Co) semigroup, {e tA : t > 
0}, is 



u)(A) := lim - log \\e 

t— >oo t 



Generally, — oo < s(A) < oj{A) < +oo, but condi- 
tions for s(A) = uj(A) or s(A) < ui(A) are part of 
a more involved theory for the asymptotic growth of 
semigroups (see [31]). 

Definition 1 A is resolvent positive if there is £o such 
that (£o, oo) G g(A) and _R(£, A) is positive for all £ > £o 



The relationship of the resolvent positive property to 
other familiar operator properties includes the following 
list of key results: 

1. If A generates a Co-semigroup T t , then T t is positive 
for alH > if and only if A is resolvent positive 36, 
p. 188]. 

2. If A is a resolvent positive operator defined densely on 
X = C(S), the Banach space of continuous complex- 
valued functions on compact space S, then A gener- 
ates a positive Co-semigroup [3J)J Theorem 3.11.9]. 

3. If A is resolvent positive and its domain, D(A) C X, 
is dense in X, then for every / G D(A 2 ), there exists 
a unique solution, u(t) G D(A) for all t > 0, u G 
C 1 ([0, oo), X), to the Cauchy problem [35] Theorem 
7.1] 

°— = Au(t) (t > 0), u(0) = /. 

4. If A is resolvent positive then: s(A) < +oo; if cr(A) 
is nonempty, i.e. — oo < s(A), then s(A) G o~(A); if 
£ G If! g(A) yields R(£, A) > then £ > s(A) [3] (36] 
Proposition 3.11.2]. 



5. Differential operators higher than second order are 
never resolvent positive [37, Corollary 2.3] [38 . 

6. Particular cases of resolvent positive operators in- 
clude 



(a) second-order elliptic operators 
d 2 



A = ^ a jk(x) 



j,k=l 



dxjdxk 



where the matrix [ a jk{x)]™ k _ 1 is symmetric and 
positive-definite for each x, and appropriate reg- 
ularity conditions hold for the domain and coef- 
ficients (e.g. [35], [3], @0]). 

(b) Linear integral operators A on X = C(fi) de- 
fined by 



(Af)(x) 



K(x,y)f(y)dy + b(x)f(x), 



where K G C(Tt x n,M+), Q C R n is bounded, 
and if (x, y) > 0, 6(x) are measurable functions 
for x, y G [231 HU S2]- A resolvent positive 
combination of integral and differential operator 
is analyzed in |43| . 

Kato's generalization of Cohen's theorem is as follows. 



Theorem 4 (Generalized Cohen's theorem [3]) 

Consider X = C(S) (continuous functions on a com- 
pact Hausdorff space S) or X — L P (S), I < p < +oo, 
on a measure space S , or more generally, let X be the 
intersection of two LP -spaces with different p's and differ- 
ent weight functions. Let A: X i-> X be a linear operator 
which is resolvent positive. Let V be an operator of mul- 
tiplication on X represented by a real-valued function v, 
where v G C(S) for X = C(S), or v G L 00 ^) for the 
other cases. 

Then s(A+V) is a convex function ofV. Lf in particular 
A is a generator, then both s(A + V) and u(A + V) are 
convex in V . 

Results 

Theorem 5 (Generalized Karlin's theorem) 

Let A be a resolvent positive linear operator, and V be an 
operator of multiplication, under the same assumptions as 
Theorem [^J 

Then for m > 0, 

1. s(m A + V) is convex in m; 

2. For each m > 0, either 

(a) s{[m+d)A+V) < s(mA+V)+ds(A) V d > 0, 
or 
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(b) s((m + d)A + V) = s{mA + V) + ds(A) V d > 0; 

3. In particular, when s(A) — then s(m A + V) is non- 
increasing in m (the 'reduction phenomenon'), and 
when s(A) < then s(mA + V) is strictly decreasing 
in m; 



4- Whenever- — s(mA + V) exists, then 
dm 



— s(mA + V) < s(A). 
dm 



(3) 



If A is a generator of a Cq -semigroup, then the above re- 
lations on s(mA + V) also apply to the type Lu(mA + V). 

Proof: We consider the general form 

4>(a, P) := s{aA + /3V) or w(ai + /3V) (4) 

where a > 0, (3 G R. Kato [3] explicitly shows that <j)(l,/3) 
is convex in /3 (which he points out is equivalent to vary- 
ing V). Lemma Q] (to follow) shows that this implies the 
properties asserted above regarding the effects of varying 
m on s(m A + V) = 0(m, 1). □ 

Lemma 1 (Dual Convexity) Let / : R x R 4 I, fee 

jointly continuous. For x > and y > 0, Zei / /iai>e tte 
following properties: 

f(ax, ay) = af(x, y), for a > 0, (5) 

and 

/(x,y) is convex in y. (6) 

TTien /or a; > 0: 

1. f(x,y) is convex in x, for y > 0; 

2. For each x > 0, either 

(a) /(x + d,l) < f(x,l) + df(l,0) V d> 0; or 
f&J /(x + d,l) = /(i 1 l) + d/(l,0)V<i>0; 

3. When it exists, g-f{x, 1) < /(1,0). 

If, in the above, /(x, y) is strictly convex in y, then /(x, y) 
is strictly convex in x , and f(x+d, 1) < f(x, l)+df(l, 0). 

The results is unchanged if the inequalities on y are re- 
versed. 

Proof: 

[TJ f(x, y) is convex in x, for y > 0. 

The relation f(ax, ay) = af(x, y) allows a set of rescal- 
ings that transform convexity in y into convexity in x. 
It is perhaps worth noting that this relation is actually 
a homomorphism, which can be put into a more familiar 



form by defining product x -k y := f(x,y), and function 
ip(x) := ax, which gives ip( x ) = VK 2 -*?/)- 

For the following derivations, the constraints are y ^ 0, 
2/1; Vi 7^ have the same sign as y, and < m < 1. 

These restrictions are made so that {y,yi,y2,m,l—m, 
(1— m)j/i + 7TM/2} are nonzero and all have the same sign, 
so that division with them is defined, and their ratios do 
not change sign when the sign of y is reversed. 

Convexity of / in y gives 

(l-m)f(x,y 1 ) + mf(x,y 2 ) > f(x, (l-m)yi + my 2 ), (7) 

for m G (0,1), yi ^ y 2 - Using ([5]) with substitutions 
a = yi/y, a = y 2 /y, and a = [(l-m)yi + my 2 ]/y in the 
terms in |7]), where y €E (0, 00), yields: 

(1 -m)— /( — ,y) +m—f{ — ,y) 

y vx y V2 



(l-m)yi + mz/2 ,/_ 

7 Ml 



xy 



I) 



(l-m)yi + my 2 



(8) 



Let xi := xy/yi and x 2 := xy/y 2 represent the rescaled 
arguments for / on the left side of ©. We see that xi, x 2 S 
(0,oo) since x € (0, 00) and y,y\,y 2 ^ have the same 
sign. 

We try the ansatz that X\ and x 2 can be combined con- 
vcxly to yield the third rescaled argument on the right side 
of©: 

^ = (l-h) Xl +hx 2 =(l-h)^-+h^-. 

{l-m)yi+my 2 y x y 2 



The ansatz has solution 

_ my 2 

(1— m)yi + my 2 ' 



and 1 — h 



(l-m)y 1 



(l-m)yi +my 2 



Note that h G (0, 1) is assured because yi and y 2 have the 
same sign, y\ ^ y 2 , and m G (0, 1). 

Define 4> := [(1— m)yi + my 2 ]/y. Then 4> > since 
J/j2/1:J/2 all have the same sign. Substitution gives 
(1— m)yi/y = (l—h)(f>, and my 2 /y = h<fi, and ([8]) becomes: 

(l-hWix^y) + Wf(x 2 ,y) > #((l-/i)x! + ftx 2) y). 

After dividing both sides by <p > 0, 

(l-h)f(xx,y)+hf(x 2 ,y) > f((l-h)xx + hx 2 , y), (9) 

which is convexity in x. The case of strict convexity 
follows by substituting > for > throughout. 

[21 Either f(x + d, 1) < /(x, 1) + d/(l,0) V d > 0, or 
f(x + d,l) = /(x,l)+d/(l,0) Vrf>0. 

The strategy will be to show first that /(x + d, 1) < 
/(x, 1) + d /(l, 0). Next, it is shown that if f(x + d,l) < 
/(x, l)+d /(l, 0) for any d > 0, then it is true for all d > 0, 
because convexity prevents f(x + d, 1) from ever returning 
to the line /(x, 1) + d /(l, 0) for d > 0. 
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By ([5]), for x, d > 0, we have the equivalences 
f(x + d,l) < f(x,l)+df(l,0) 
(x + d)f(l,^)< */(l,i)+d/(l,0) 



fO-,0). (li) 



Since the y arguments for f(x,y) in (111|) are related by 
convex combination, 



1 



x + d x + d x 



a; + d 



*0, 



then (jlip is just a statement of the convexity of /(a;, y) in 
y, as hypothesized. The case of strict convexity follows by 
substituting < for <, throughout. 

Now, given x > 0, suppose that for some d\ > 0, 



Remark 1 It would be clearly desirable to characterize 
the conditions for strict convexity in Kato's theorem, so 
(10) that by Lemma [TJ one would obtain strict convexity in 
Theorem [5] itcm[TJ and strict monotonicity in items [3] and 
HI Indeed, item is the best that can be offered in the 
way of strict inequality without strict convexity. But the 
problem is more technical and is deferred to elsewhere. 

It is reasonable, nevertheless, to conjecture that the 
properties which produce strict convexity in the matrix 
case [331 Theorem 4.1] [15] Theorem 1.1] extend to their 
Banach space versions: i.e. for a > 0, when resolvent pos- 
itive operator A is irreducible [46] p. 250] (47j p. 41], then 
s(aA + (3V) is strictly convex in f3 if and only if V is not 
a constant scalar. 

A Third Proof of Karlin's Theorem 5.2 



f(x + dr,l) < /(*,!) + d 1 /(l,0). 



(12) 



We shall see that convexity then prevents f(x + d, 1) from 
ever returning to the line f(x, 1) + df (1,0) for d > 0. 

We consider five points: x < x + do < x + d\ < x + d 2 < 
x + g?3 < oo. For readability, write g(x) = f(x, 1) and 
F = /(l, 0). By convexity ©, and hypothesis 

g{x + d ) < (l - ^-)g(x) + j-g(x + di) 

< (1 - ^)g(x) + ^(g(x) + d x F) = g(x) + d F. 
and, by ©, (THJl, and (TTUJ) (line 3 below), 
g(x + d 2 ) < ^ — -rg{x + di) + ~r — -rg( x + rf 3) 

"3 — «1 «3 — "1 

d 3 - di da - d\ 

< y^t(9(x) + d lF ) + %^-(g(x) + d 3 F) 

«3 — d\ «3 — «1 

,g(.T + ff 2 )(rf 3 - di) 

< (d 3 - da) (^(a:) + diF) + (da - di)(s(a?) + dsi 71 ) 
= (dg - di)g(x) + d 2 (d 3 - di)F 
g(x + d 2 ) < g(x) + d 2 F. 

d 

O When it exists, faffal) < fO-,0). 



Rearrangement of (fTO]) gives 

f(x + d,l)-f(x,l) 



</(l,0). 



Hence when the limit exists, 

,. f(x + d,l)-f(x,l) df(x,l) 



lim 



d 



dx 



</(l,0). 



Karlin's proof was based on the Donsker-Varadhan vari- 
ational formula for the spectral radius [IS] . Kirkland et al. 
[4*U] recently discovered another proof using entirely struc- 
tural methods. A third distinct proof of Karlin's theorem 
is seen here by application of Lemma [T] to Cohen's the- 
orem, combined with Friedland's equality condition |441 
Theorem 4.1] (see also [55] for a different proof), as fol- 
lows. 

The expression in Karlin's Theorem 5.2 can be put in 
the form used in Theorem [5] 

M(m)D = [{l-m)l + mP]D = m(P-I)D + D 

= aA + /3D, 

where A = (P— I)D, a — m, and (3 = 1. 

Since e T (P-I)D = (e T - e T )D = 0, we see that 
s(A) = s((P— I)D) = 0. Cohen's theorem gives that 
s(aA + /3D) is convex in (3, and thus by Lemma [U 
s(aA + /3D) is convex and non-increasing in a. Applica- 
tion of Lemma[T]therefore yields that jo(M(m)D) is convex 
and non-increasing in m, and dp(M(m)D)/dm < 0, the 
derivative existing for all to > when M(m)D is irre- 
ducible. 

From Friedland (44] Theorem 4.1], strict convexity in j3 
occurs if P is irreducible and D ^ cl, for any c > 0. By 
Lemma Q] this implies s(M(m)) is strictly decreasing and 
strictly convex in to. □ 

Remark 2 The core of Kirkland et al.'s proof is their 
Lemma 4.1, which can be stated as 

e T A (u(A) o v(A)) > u(A) T Av(A) = s(A), 

with equality only when e T A = s(A)e T , where u(A) T 
and v(A) are the left and right eigenvectors of A associ- 
ated with the Perron root s(A), and u o v is the compo- 
nentwise (Schur-Hadamard) product. Without the equal- 
□ ity condition, their result is a special case of [50] Theorem 
3.2.5], but to obtain the equality condition requires an 
approach their novel proof provides. 
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Remark 3 Schreiber and Lloyd-Smith [51, Appendix B, 
Lemma 1] followed the reverse path and extended Kirkland 
et al's result on s(M(m)D) to the form s(aA+D), where 
A is essentially nonnegative and D any diagonal matrix. 



Lemma [T] can also be used as a new proof of an in- 
equality of Lindqvist, the special case considered in |52[ 
Theorem 2, pp. 260-261]. 
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Theorem 6 (Lindqvist [52, Theorem 2, subcase]) 

Let A be an irreducible n x n real matrix such that 1 ) 
Aij > for i ^ j , and 2) The left and right eigenvectors 
of A, u(A) T and v(A), associated with eigenvalue s(A), 
satisfy u(A) T v(A) = 1. Let D be an n x n real diagonal 
matrix. Then 



s(A+D) - s(A) > u(A) T Dv(A). 



(13) 



Proof: Since A is an essentially nonnegative matrix, s(A) 
is an eigenvalue of multiplicity 1. Consider the representa- 
tion A = mB — D, where B is essentially nonnegative and 
m > 0. Write s = s(A). As A is irreducible, u = u(A), 
v = v(A), with u T v = 1, e T v = 1, are unique, and the 
derivatives exist [26] in the following [53J Sec. 9.1.1]: 



u 



-d(Av) 
dm 



u 



<9A 



dv 



dm ~*~ dm 



u T Bv + su 1 



u T ^— (sv) 
dm 



dv 
dm 
dv 



ds 

V + s 

dm dm 



ds_ 

dm 



s u 



dv 

dm 



Cancellation of terms sv^dv/dm gives 

^ = u(A) T f^v(A) = u(A) T Bv(A) < S (B), 
dm dm 

the inequality coming from Lemma [TJ Scaling by m, sub- 
tracting D, and substituting mB = A + D, we get 

u(A) T (mB - D)v(A) = s(A) < s(mB) - u(A) T Dv(A) 
^ u(A) T Dv(A) < s(A+D) - s(A). □ 
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